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The benefits of a recently proposed method to approximate hard optimization problems are demon- 
strated on the graph partitioning problem. The performance of this new method, called Extremal 
Optimization, is compared to Simulated Annealing in extensive numerical simulations. While gener- 
ally a complex (NP-hard) problem, the optimization of the graph partitions is particularly difficult 
for sparse graphs with average connectivities near the percolation threshold. At this threshold, the 
relative error of Simulated Annealing for large graphs is found to diverge relative to Extremal Opti- 
mization at equalized runtime. On the other hand, Extremal Optimization, based on the extremal 
dynamics of self-organized critical systems, reproduces known results about optimal partitions at 
this critical point quite well. 



I. INTRODUCTION 

The optimization of systems with many degrees of free- 
dom with respect to some cost function is a frequently en- 
countered task in physics and beyond jjj . In cases where 
the relation between individual components of the sys- 
tem is frustrated such a cost function often exhibits 
a complex "landscape" || over the space of all configu- 
rations. For growing system size, the cost function may 
exhibit an exponentially increasing number of unrelated 
local extrema separated by sizable barriers which makes 
the search for the exact, optimal solution usually unrea- 
sonably costly. Thus, it is of great importance to develop 
fast and reliable methods to find near-optimal solutions 
for such problems. 

The observation of certain physical processes, in par- 
ticular the annealing of disordered materials, have lead 
to general-purpose optimization methods such as "simu- 
lated annealing" (SA) (|||. SA applies the formalism of 
equilibrium statistical mechanics and in principle only re- 
quires the cost function as input. Thus, it is applicable to 
a variety of problems. But the performance of SA is hard 
to assess in general, even when limited to the standard 
combinatorial optimization problems. Aside from a mul- 
titude of adjustable parameters that crucially determine 
the quality of SA's performance in a particular context, 
typical combinatorial optimization problems themselves 
possess various parameters that may change the land- 
scape and SA's behavior drastically @|. 

In this paper we will explore the properties of a new 
general-purpose method, called Extremal Optimization 
(EO) 0, in comparison with SA. In contrast to SA, EO 
is based on ideas from non-equilibrium physics. As the 
basis for comparison we will use the graph partitioning 
problem (GPP), a standard NP-hard combinatorial opti- 
mization problem [§J with similarities to disordered spin 
systems. We find that the GPP has a critical point as a 
function of the connectivity of graphs, with a less com- 
plex phase at lower connectivities. This critical point is 
related to the percolation transition of the graphs. Near 



this critical point, the performance of SA markedly de- 
teriorates while EO produces only small errors. 

This paper is organized as follows: In the next section 
we describe the philosophy behind the EO method, in 
Sec. ID we introduce the graph partitioning problem, 
and Sec. IV we present the algorithms and the results 



obtained in our numerical comparison of SA and EO, 
followed by conclusions in Sec. |v|. 



II. EXTREMAL OPTIMIZATION 

EO provides an entirely new approach to optimization 
[EL based on the non-equilibrium dynamics of systems 
exhibiting self-organized criticality (SOC) SOC often 
emerges when a system is dominated by the evolution of 
extremely atypical degrees of freedom [[l0| . 

A simple example of such a dynamical system which in- 
spired the development of EO is the Bak-Sneppen model 
pT[ . There, species are represented by a number between 
and 1 that indicates their "fitness," located on the sites 
of a lattice. The smallest number (representing the worst 
adapted species) at each update is discarded and replaced 
with a new number drawn from a uniform distribution 
on [0,1]. Without any interactions, all the numbers in 
the system would eventually become 1. But obvious in- 
terdependencies between species provide constraints for 
balancing the systems' fitness with that of each species: 
The change of fitness in one species impacts the fitness of 
an interrelated species. In the Bak-Sneppen model, the 
fitness values on all sites neighboring the smallest number 
at that time step are simply replaced with new random 
numbers as well p2[ . After a certain number of such up- 
dates, the system organizes itself into a highly correlated 
state known as self-organized criticality (SOC) ||. 

In the SOC state, almost all species have reached a fit- 
ness above a certain threshold. But these species merely 
possess what is referred to as punctuated equilibrium 
[0,L3|, because the co-evolutionary activity is bound to 
return in a chain reaction where a weakened neighbor 
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can undermine one's own fitness. Fluctuations that rear- 
range the fitness of many species abound and can rise to 
the size of the system itself, making any possible config- 
uration accessible. Hence, such non-equilibrium systems 
provide a high degree of adaptation for most entities in 
the system without limiting the scale of change towards 
even better states. 
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FIG. 1. Two random geometric graphs, TV = 500, with 
a — 4 (top) and a = 8 (bottom) in an optimized configu- 
ration found by EO. At a — 4 the graph barely percolates, 
with merely one "bad" edge (between points of opposite sets, 
masked by diamonds) connecting a set of 250 round points 
with a set of 250 square points, thus m opt = 1. For the denser 
graph on the bottom, EO reduced the cutsize to m opt = 13. 

EO attempts to utilize this phenomenology to obtain 
near-optimal solutions for optimization problems [p"4|| . 
For instance, in a spin glass system jl] we may consider 
as fitness for each spin its contribution to the total energy 
of the system. EO would search for ground state configu- 
rations by perturbing preferentially spins with large con- 
tributions. Like in the Bak-Sneppen model, such pertur- 
bations would be local, random rearrangements of those 
poorly adapted spins, allowing for better as well as for 
worse outcomes at each update. In the same way as 
systems exhibiting SOC get driven recurrently towards 
a small subset of attractor states through a sequence of 
"avalanches" fL5|,p|, EO can fluctuate widely to escape 
local optima while the extremal selection process ensures 



recurrent approaches to many near-optimal configura- 
tions. Especially in exploring low temperature proper- 
ties of disordered spin systems, those qualities may help 
to avoid the extremely slow relaxation behavior faced by 
heat bath based approaches pS. In that, EO provides an 
approach alternative - and apparently equally capable Q] 
- to Genetic Algorithms, which are often the only means 
to illuminate those important properties Jl7[ . The parti- 
tioning of sparse graphs as discussed here is particularly 
pertinent in preparation for similar studies on actual spin 
glasses. 

It has been observed that many optimization problems 
exhibit critical points that separate off phases with simple 
cases of a generally hard problem . Near such a criti- 
cal point, finding solutions becomes particularly difficult 
for local search methods which proceed by exploring for 
an existing solution some neighborhood in configuration 
space. There, near-optimal solutions become widely sep- 
arated with diverging barrier heights between them. It 
is not surprising that search methods based on heat-bath 
techniques like SA are not particularly successful in this 
highly correlated state [jlfj). In contrast, the driven dy- 
namics of EO does not possess any temperature control 
parameters to successively limit the scale of its fluctua- 
tions. Our numerical results in Sec. IV show that EO's 



performance does not diminish near such a critical point. 
A non-equilibrium approach like EO may thus provide 
a general-purpose optimization method that is comple- 
mentary to SA: While SA has the advantage far from 
this critical point, EO appears to work well "where the 
really hard problems are" |R8| . 



III. GRAPH PARTITIONING 

To illustrate the properties of EO and its differences 
with SA, we focus in this paper on the well-studied graph 
partitioning problem (GPP). In particular, we will con- 
sider the GPP near a phase transition where the opti- 
mization problem becomes especially difficult and pos- 
sesses many similarities with physical systems. 



A. Formulation of the Problem 

The graph (bi-)partitioning problem is easy to formu- 
late: Take N points where N is an even number, let any 
pair of two points be connected by an edge with a certain 
probability, divide the points into two sets of equal size 
N/2 such that the number of edges connecting both sets, 
the "cutsize" m, is minimal: m — m opt . The global con- 
straint of an equal division of the points between the sets 
places this problem generally among the hardest prob- 
lems in combinatorial optimization, requiring a compu- 
tational effort that would grow faster than any power of 
N to determine the exact solution with certainty la] . The 
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two physically motivated optimization methods, SA and 
EO, which we focus on here, usually obtain approximate 
solutions in polynomial time. 

For random graphs, the GPP depends on the proba- 
bility p with which any two points in the system are con- 
nected. Thus, p determines the total number of edges in 
an instance, L — pN(N — l)/2 on average, and its mean 
connectivity per point, a = p(N—l) on average. Alterna- 
tively, we can formulate a "geometric" GPP by specifying 
N randomly distributed points in the 2-dimensional unit 
square which are connected with each other if they are 
located within a distance d of one another. Then, the 
average expected connectivity a of such a graph is given 
by a = Nnd 2 . This form of the GPP has the advantage 
of a simple graphical representation, shown in Fig. |l|. 

It is known that geometric graphs are harder to opti- 
mize than random graphs |l9| . The characteristics of the 
GPP for random and geometric graphs at low connec- 
tivity appear to be very different due to the dominance 
of long loops and short loops, resp., and we present re- 
sults for both types of graphs here. In fact, in the case 
of random graphs the structure is locally tree-like which 
allows for a mean-field treatment that yields exact re- 
sults pC|-[2^]. In turn, the geometric case corresponds 
to continuum percolation of "soft" (overlapping) circles 
for which precise numerical results exist ]25| ] . Finally, we 
also try to determine the average ground state energy 
of a dilute ferro-magnetic system on a cubic lattice at 
fixed (zero) magnetization, which amounts to the equal 
partitioning of "up" and "down" spins while minimiz- 
ing the interface between both types ^T|. Here, each 
vertex of the lattice holds a ±-spin, and any two nearest- 
neighbor spins either possess a ferromagnetic coupling of 
unit strength or are unconnected. The probability that a 
coupling exists is fixed such that the average connectivity 
of the system is a. 

B. Graph Partitioning and Percolation 

Like many other optimization problems, the GPP ex- 
hibits a critical point as a function of its parameters jl8) . 
In case of the GPP we observe this critical point as a 
function of the connectivity a of graphs, with the cutsize 
TO op t as the order parameter. In fact, the critical point of 
partitioning is closely linked to the percolation threshold 
of graphs. In our numerical simulations we proceed by 
averaging over many instances of a class of graphs and 
try to reproduce well-known results from the correspond- 
ing percolation problem. Of course, using stochastic op- 
timization methods (instead of cluster enumeration) is 
neither an efficient nor a precise means to determine per- 
colation thresholds. But in turn we obtain also some 
valuable information about the scaling behavior of the 
average cost < m opt > for optimal partitions near the 



threshold that goes beyond the percolating properties of 
these graphs. 

We note, in accordance with Ref. [^I), that the criti- 
cal point separates between hard cases and easy-to-solve 
cases of the GPP. The transition is related to the corre- 
sponding percolation problem for the graphs in the fol- 
lowing manner: If the mean connectivity a is very small, 
the graph of N points consists mainly of disconnected, 
small clusters or isolated points which can be enumer- 
ated and sorted into two equal partitions in polynomial 
time with no edges between them (m op t =0). If a is large 
and the probability that any two points are connected is 
p = 0(1), almost all points are connected into one giant 
cluster with m opt = 0(7V 2 ), and almost any partition 
leads to an acceptable solution. But when p = 0(1/N), 
i. e. a — 0(1), the distribution of cluster sizes is broad, 
and the partitioning problem becomes nontrivial. Obvi- 
ously, as soon as a cluster of size > N/2 appears, m opt 
must be positive. In this sense, we observe for N — » oo 
a sharp, percolation-like transition at an a cr it with the 
cutsize TO opt as the order parameter. 

For random graphs it is known that a cluster of size N 
exists for a > 1 [^(j, but only for a>a c =2rn2w 1.386 
do we find a cluster of size > N/2 |pl| . Geometric graphs 
in D = 2 are known to percolate at about a — 4.5 p3[ , 
and we would expect a c for the GPP to be slightly larger 
than that. Also, the dilute ferro-magnet should exhibit 
a non-trivial energy when the fraction of occupied bonds 
reaches slightly beyond the critical point p c w 0.2488 for 
bond percolation on a cubic (D = 3) lattice pj|, i. e. for 
connectivities a > 2Dp c » 1.493. 

IV. NUMERICAL EXPERIMENTS 

A. Simulated Annealing Algorithm 

In SA jH), we try to minimize a global cost function 
given by / = to + p.{P\ — P2) 2 , where P\ and P2 are 
the number of points in the respective sets. Allowing the 
size of the sets to fluctuate is required to improve SA's 
performance in outcome and computational time at the 
cost of an arbitrary parameter /i to be determined. Then, 
starting at a "temperature" To, the annealing schedule 
proceeds with IN trial Monte-Carlo steps on / by tenta- 
tively moving a randomly chosen point from one set to 
the other (which changes to) to equilibrate the system. 
This move is accepted, if / improves or if the Boltzmann 
factor exp[(/ id — /now)/T] is larger than a randomly 
drawn number between and 1. Otherwise the move 
is rejected and the process continues with another ran- 
domly chosen point. After that, we set Tj = Tj_i(l — e), 
equilibrate again for IN trials, and so on, until the MC 
acceptance rate drops below A st0 p for K consecutive tem- 
perature levels. At this point the optimization process 
can be considered "frozen" and the configuration should 
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be near-optimal, to « m opt (and balanced, PI = P2). 
While SA is intuitive, controlled, and of very general 
applicability, its performance in practice is strongly de- 
pendent on the multitude of parameters which have to 
be arduously tuned. For us it is thus expedient (and 
most unbiased!) to rely on an extensive study of SA 
for graph partitioning |l9| which determined /i = 0.05, 
T = 2.5, e = 0.04, A stop = 2%, and K = 5. Ref. JI| 
set I = 16, but performance improved noticeably for our 
choice, I = 64. 

B. Extremal Optimization Algorithm 

In EO [Q, each point i obtains a "fitness" A^ = g,/ (<?i + 
hi) where gi and &i are the number of "good" and "bad" 
edges that connect that point within its set and across 
the partition, resp. (We fix A, = 1 for isolated points.) 
Of course, point i has an individual connectivity of cti — 
gi + bi while the overall mean connectivity of a graph is 
given by a = J^. on/N . The current cutsize is given by 
to = 53j&i/2. At all times, an ordered list Ai < A2 < 
... < An is maintained where A„ is the fitness of the 
point with the n-th rank in the list. 

At each update we draw two numbers, 1 < ni, n% < N, 
from a probability distribution 

P(n) ~ n- T . (1) 

Then we pick the points which are elements n\ and ri2 of 
the rank-ordered list of fitnesses. (We repeat a drawing 
of ni until we obtain a point that is from the opposite 
set than m.) These two points swap sets no matter what 
the resulting new cutsize to may be, in notable distinc- 
tion to the (temperature-) scale-dependent Monte Carlo 
update in SA. Then, these two points, and all points they 
are connected to (2a on average) , reevaluate their fitness 
A. Finally, the ranked list of A's is reordered using a 
"heap" at a computational cost oc a In N, and the pro- 
cess is started again. We repeat this process for a number 
of update steps per run that rises linearly with system 
size, and we store the best result generated along the 
way. Note that no scales are introduced into the process, 
since the selection follows a scale-free power-law distri- 
bution P(n) and since - unlike in a heat bath - all moves 
are accepted, allowing for fluctuations on all scales. In- 
stead of a global cost function, the rank-ordered list of 
fitnesses provides the information about optimal config- 
urations. This information emerges in a self-organized 
manner merely by selecting with a bias against badly 
adapted points, instead of "breeding" better ones [ pr| . 

There is merely one parameter, the exponent r in the 
probability distribution in Eq. ([[J), that controls the se- 
lection process and optimizes the performance of EO. In 
initial studies, we determined r = 1.4 as the optimal 
value for all graphs. It is intuitive that such an optimal 



value of t should exist: If r is too small, points would be 
picked purely at random with no gradient towards a good 
partition, while if r is too large, only a small number of 
points with particularly bad fitness would be chosen over 
and over again, confining the system to a poor local op- 
timum. It is a surprising numerical result that this value 
of t appears to be rather universal, independent of A, a, 
and the type of graph considered. 

C. Testbed of Graphs 

In our numerical simulations we have generated ran- 
dom and 2D geometric graphs of varying connectivity by 
choosing p or d, resp. For any instance of a graph labeled 
by a "connectivity a" , the actual connectivity not only 
varies from point to point, but also the mean connectivity 
of such graphs follows a normal distribution. (In particu- 
lar for geometric graphs it is shifted to lower values due to 
the loss of connectivity at the boundaries.) For N = 500, 
1000, 2000, 4000, 8000, and 16000, we varied the connec- 
tivity between a = 1.25 and a = 5 for random graphs, 
and a = 4 and a = 10 for geometric graphs. Then, 
for each a we generated 16 different instances of graphs, 
identical for SA and EO. On each instance, we performed 
8 (32) optimization runs for random (geometric) graphs, 
both for EO and SA. Each run, we used a new random 
seed to establish an initial partition of the points. SA's 
runs terminate when the system freezes. We terminated 
EO-runs after 200 A updates, leading to a comparabile 
runtime between both methods. 

100 , , , , , , 
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FIG. 2. Typical runtime comparison of SA and EO for the 
ferromagnet on a 200MHz Pentium. Runtimes generally rise 
oc TV, fall with the connectivity a for SA (filled symbols), 
but rise oc a for EO (unfilled symbols). Circles refer to an 
average runtime of graphs with TV = 512 = 8 3 points, squares 
to N = 1728 = 12 3 , diamonds to N = 4096 = 16 3 , and 
triangles to TV = 8000 = 20 3 . 

For the dilute ferro-magnet, we fixed the number of 
couplings to obtain a specific average connectivity a. 
Those couplings were then placed on random links be- 
tween nearest-neighbor spins to generate an instance. We 
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used 16 instances, and 16 runs for each, at connectivities 
1.6 < a < 4. Here, we only used 100TV updates for 
EO, and the temperature length of 16TV recommended 
in Ref. ]l^ | but with a higher starting temperature for 
SA to optimize performance at a comparable runtime for 
both methods, as shown in Fig. ||. 
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FIG. 3. Plot of the error of SA relative to the best re- 
sult found on (a) random graphs (top), (b) geometric graphs 
(middle), and (c) the dilute ferromagnet (bottom) as func- 
tion of the mean connectivity a. While the error near a c only 
increases slowly for random graphs, it appears to increase lin- 
early with TV for the ferromagnet and for geometric graphs. 
At fixed but large connectivities, SA increasingly gains on EO 
for rising N. 



D. Evaluation of Results 



1. Comparison of EO and SA 

We evaluate the performance of SA and EO separately. 
For each method, we only take its best result for each 
instance and average those best results at any given con- 
nectivity a to obtain the mean cutsize for that method 
as a function of a and TV. To compare EO and SA, we 
determine the relative error of SA with respect to the 
best result found by either method for a > a c . Figs. ||a- 
c show how the error of SA diverges with increasing TV 
near to a c for each class of graphs. 

Depending on the type of graph under consideration, 
the quality of the SA results may vary. The data for 
random graphs in Fig. |3|a only shows a relatively weak 
deficit in SA's performance relative to EO. Near a c = 
2 In 2 = 1.386, SA's relative error remains modest, and 
only grows very weakly with increasing TV. For large con- 
nectivities a, SA quickly becomes the superior method 
for random graphs, which may be due to their increas- 
ingly homogeneous structure (i. e. low barriers between 
optima) that does not favor EO's large fluctuations. On 
the other hand, the averages obtained by EO appear to 
be very smooth (see the scaling in Fig. [|a) whereas the 
apparent noise in Fig. [|a indicates large variations be- 
tween instances for the SA results. 

The very rugged structure of geometric graphs near 
the percolation threshold (see Fig. |l|a), a c w 4.5, is most 
problematic for SA, leading to huge errors which appear 
to increase linearly with TV. Barriers between optima are 
high within each graph, now favoring EO's propensity 
for large fluctuations. On the scale of Fig. ||b, error bars 
attached to the data (which we have generally omitted) 
would hardly be significant. But experience shows that 
both methods exhibit large variations in results between 
instances which is in large part due to actual variations 
in the structure between geometric graphs. 

The results for the dilute ferromagnet exhibit a mix 
of the two previous cases. Since the points are arranged 
on a D = 3-lattice, the structure of these graphs is def- 
initely geometrical, but local connectivities are limited 
to the 2D = 6 nearest neighbors that each point pos- 
sesses. Again, SA's error is huge and appears to diverge 
about linearly near the threshold, a c w 1.5. But due to 
the limited rage of connectivities, graphs soon become 
rather homogeneous for increasing a which in turn ap- 
pears to favor SA away from the transition, especially 
for larger graphs. (For larger TV, any local structure gets 
quickly averaged out due to the local limits on the con- 
nectivity, whereas an unlimited range of local structures 
can emerge in the geometric graphs above.) 



2. Scaling of EO-Data near the Transition 
For the data obtained with EO, we make an Ansatz 



5 



(ro opt ) ~ N v (a - a c y 



(2) 



to scale the data for all N onto a single curve, shown in 
Figs. ^a-c. From the scaling Ansatz we can extract an 
estimate for a c to compare with percolation results as a 
measure of the accuracy of the data obtained with EO. 
Furthermore, we also obtain a numerical estimates for 
the exponents v and (3 which characterize the transition. 
The exponent v : describing the finite-size scaling behav- 
ior, could be infered from general, global properties of a 
class of graphs. For instance, v = 1 for random graphs 
because any global property of these graphs is extensive 
pjj . On the other hand, the exponent (3, describing the 
scaling of the order parameter near the transition, is re- 
lated to the intricate structure of the interface needed 
to separate points into equal-sized partitions. Thus, we 
would expect (3 to be nontrivial even for random graphs. 
(To our knowledge, no previous predictions for these ex- 
ponents exist.) 

For random graphs in Fig. |Ja, the scaling Ansatz in 
Eq. (||) is particularly convincing. We verify that v = 1 
and obtain f3 = 1.2. From the fit we obtain also a c « 
1.30, just slightly below the exact value of 1.38 |pl]| . The 
fit produces an error of about ±0.1 in the determination 
of (3, which would ignore any error received through the 
limited number of instances averaged over, or any bias 
due to the shortcomings of EO to approach the exact 
optima. A satisfactory fit in turn would indicate that 
such errors should be negligible. 

For geometric graphs in Fig. [|b, we found the best scal- 
ing for v = 0.6. Since we used only 16 different instances 
to average over at each N and a, the data gets very noisy 
for larger connectivities due to large fluctuations in the 
optimal cutsizes between those instances and/or EO's in- 
ability to find good approximations. We chose to fit only 
points up to a = 7 and obtained (3 = 1.4 and a c w 4.1, 
even smaller than the critical value for percolation, 4.5. 
Obviously, the obtained values are very poor, but at least 
indicate EO's ability to approximate the optimal cutsizes 
with bounded error near the transition. 

The data for the dilute ferromagnet in Fig. ||c appears 
to scale well for v — 0.75. Since EO's performance is 
falling behind that of SA for a > 3 we only fit to smaller 
values of a and obtain f3 = 1.15 and a c = 1.55, as desired 
just slightly larger than the value for percolation, 1.49. 
We estimate the error from the fit for each of these values 
to be about ±0.05. 



3. Fixed-Valence Graphs 

Finally, we have also performed a study on graphs 
where points are linked at random, but where the connec- 
tivity a at each point is fixed. These graphs have been 
investigated previously theoretically |2^j2l| and numeri- 
cally using SA |27j]. While a now is fixed to be an integer, 



we can not tune ourselves arbitrarily close to a critical 
point. Furthermore, the problem is non-trivial only when 
a > 3. These graphs have the property that at a given a 
and N the optimal cutsizes between instances vary little, 
and only few instances are needed to determine (m opt ) 
with good accuracy. 
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FIG. 4. Scaling plot of the data from EO according to Eq. ^ 
for (a) random graphs (top), (b) geometric graphs (middle), 
and (c) the dilute ferromagnet (bottom) as function of the 
mean connectivity a. The scaling parameters and fits are 
discussed in the text. 

In our simulations we found that for larger values of 
a, SA and EO both confirm the results in Ref. |27]] quite 
well. But for a = 3, the lowest non-trivial connectivity, 
we did observe significant differences between EO and 
the study in Ref. |27[] . Ref. |27|, by averaging 5 instances 
each at various values of N (450 < N < 4000) , found a 
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normalized average energy 



E 



-1 



4(m opt ) 
aN 



(3) 



of —0.840, presumably correct to the digits given. We 
found by averaging over 32 instances, using 8 EO runs 
on each, for N = 1024, 2048, and 4096 that E = 
-0.844 ±0.001. But this result is still significantly higher 
than some theoretical predictions f26| , plj| , and we will in- 
vestigate whether longer runtimes may further reduce the 
cutsizes for these graphs B . 



V. CONCLUSIONS 

In this paper we have demonstrated that Extremal 
Optimization (EO), a new optimization method derived 
from non-equilibrium physics, may provide excellent 
results exactly where Simulated Annealing (SA) fails. 
While further studies will be necessary to understand 
(and possibly, predict) the behavior of EO, we have used 
it here to analyze the phase transition in the NP-hard 
graph partitioning problem. The results illustrate con- 
vincingly the advantages of EO and produce a new set 
of scaling exponents for this transition for a variety of 
different graphs. 

I thank A. Percus, P. Cheeseman, D. S. Johnson, D. 
Sherrington, and K. Y. M. Wong for very helpful discus- 
sions. 
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